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Abstract 

Quantum, in contrast to classical, information theory, allows for different incom- 
patible types (or species) of information which cannot be combined with each other. 
Distinguishing these incompatible types is useful in understanding the role of the two 
classical bits in teleportation (or one bit in one-bit teleportation), for discussing deco- 
herence in information-theoretic terms, and for giving a proper definition, in quantum 
terms, of "classical information." Various examples (some updating earlier work) are 
given of theorems which relate different incompatible kinds of information, and thus 
have no counterparts in classical information theory. 

I Introduction 

Despite an enormous number of publications in the field of quantum information (see [1,2] 
for useful introductions), neither the fundamental principles underlying the subject, nor its 
connection with classical information theory as developed by Shannon and his successors [3], 
is altogether clear. On the one hand there has been some dispute [4, 5] about whether 
Shannon's ideas can be applied at all in the quantum domain. On the other hand there 
have been suggestions that the connection with Shannon's ideas occurs only for macroscopic 
systems or asymptotically large N (number of transmissions, or whatever) limits, as in 
what is sometimes called "quantum Shannon" theory [6,7]. The author's position is that, 
to the contrary, there are perfectly consistent ways of applying the basic ideas of classical 
information theory to small numbers (even one) of microscopic quantum systems provided 
attention is paid to the Hilbert space structure of quantum theory, and probabilities are 
introduced in a consistent fashion. And, further, that this approach has advantages in 
that simple systems are simpler to think about than complicated systems, so it is useful 
to develop some intuition as to how they behave. One goal is to understand both classical 
and quantum information theory in fully quantum terms, since the world is (most physicists 
believe) fundamentally quantum mechanical. 

The basic strategy of this paper is based on the idea that quantum information comes in 
a variety of incompatible types or species. Each type or species refers to a certain class of 
(typically microscopic) mutually-compatible properties of a quantum system. As long as the 
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discussion of information is limited to a single type, all the usual formalism and intuition 
provided by classical information theory apply directly to the quantum domain. On the other 
hand, incompatible types of quantum information cannot be combined, as this makes no sense 
in the context of standard Hilbert space quantum mechanics. Since in classical information 
theory there is only a single type or species of information, or, equivalently, all different types 
are compatible with each other, the main respect in which quantum information theory needs 
to go beyond its classical counterpart is in relating incompatible types of information in a 
useful way. Various examples are given below. 

In the real (i.e., quantum) world it must, of course, be the case that so-called "classical" 
information, as in the acronym LOCC, "local operations and classical communication," is 
describable in quantum terms. A relatively precise definition can be given as indicated in 
Sec. IIVI classical information is a particular type of quantum information, the only one that 
survives under circumstances (implicitly assumed in much writing on the subject) where 
there is strong decoherence. 

The remainder of this paper is organized as follows. The concept of quantum information 
types is introduced in the context of a discussion of quantum incompatibility in Sec. [Ill The 
idea is illustrated by the simple examples of one- and (standard) two-bit teleportation in 
Sec. Illlt where using quantum information types and what we call the Presence theorem 
helps understand why one or two bits, respectively, are needed in these protocols, or "dits" 
in their dimensional ("qudit") generalizations. Decoherence and "classical" information 
are the subjects of Sec. IIVI which begins with a simple beam splitter example that illustrates 
the importance of the Exclusion theorem, derived from a more general Truncation theorem, 
and this sets the stage for a proper understanding of classical information in quantum terms. 

Quantum information theory requires theorems that relate different types of information, 
and hence go beyond anything in classical information theory. Those used in Sees. IIIII 
and [IV] and some others closely related to them are given precise formulations in Sec. |Vl 
extending earlier work in [8]. They all have the "smell" of no-cloning [9], but the connection is 
not altogether straightforward, as shown by an additional Generalized No-Cloning theorem. 
Proofs and some additional technical details are found in the appendices. A summary and 
an indication of various ways the present work needs to be extended comprise the concluding 
Sec.lVD 

II Types of Information 

Central to the following discussion will be the notion of quantum incompatibility [10], 
which can be illustrated using the familiar two-dimensional Hilbert space of a spin-half 
particle. Each one- dimensional subspace or ray, which is to say all complex multiples of a 
fixed nonzero ket \w), is associated with the property that a particular component of angular 
momentum is positive, Sw = +1/2 (in units of h) for some direction w in space, e.g., w = z 
or w = X or w = —z, etc. The negation of the property (or proposition) = +1/2 is the 
property 5^ = —1/2, corresponding to the orthogonal complement of the ray associated with 

= +1/2. In the notation commonly used in quantum information theory, Sz = +1/2 and 
— 1/2 correspond to rays passing through (i.e., multiples of) |0) and |1), respectively, and of 
course these are orthogonal, (0|1) = 0. 
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It always makes sense to talk about the conjunction P AND Q of two properties P and 
Q of a classical system, such as p > and a; < for a harmonic oscillator. The result may 
be a property that is always false, as when P is p > and Q is p < 0, and in this case the 
negation (NOT P) OR (NOT Q) of P AND Q is the property j9 < OR p > 0, which is 
always true. But in quantum theory it is possible to write down conjunctions, such as 



which make no sense. Obviously ([I]) cannot correspond to any ray in the Hilbert space, since 
each ray is associated with = +1/2 for some direction w, and ([T]) is not of this form. Could 
it be a proposition that is always false? Then its negation = —1/2 OR = +1/2 must 
always be true, which does not seem very plausible. Indeed, assuming that ([1]) and similar 
conjunctions are false swiftly leads to a contradiction if one follows the usual rules of logic — 
for details, see Sec. 4.6 of [11]. This was understood by Birkhoff and von Neumann [12], who 
proposed altering the rules of propositional logic to get around this difficulty. Their proposal 
has not been of much use for interpreting quantum mechanics, which may merely mean that 
we physicists are not smart enough. By contrast, if one restricts the domain of meaningful 
discourse so as to exclude ([1]) and similar things — in particular, conjunctions (AND) and 
disjunctions (OR) of properties corresponding to projectors that do not commute — it is 
possible to produce a consistent interpretation of quantum mechanics [11,13-20] that follows 
the usual rules of logic (as applied to meaningful statements) , and resolves all the standard 
paradoxes [21]. 

The compatible propositions = +1/2 and = —1/2, corresponding to mutually 
orthogonal projection operators, form a quantum sample space of mutually exclusive possi- 
bilities: their conjunction is always false, and since each is the negation of the other, one or 
the other is always true. This makes physical sense in that one can in principle carry out 
a Stern-Gerlach measurement to determine whether Sz = +1/2 or —1/2 [22]. (By contrast, 
there is no measurement which can determine the truth or falsity of ([1]), as one would ex- 
pect for something that is meaningless.) Information that answers the question of whether 
Sz = +1/2 or —1/2 is what we shall call the Z type (or species) of information. Similarly, 
X information answers the question whether = +1/2 or —1/2. It is incompatible with 
Z information in that there is no way in which the two can be meaningfully combined: 
([1]) makes no sense, and asking whether Sz = +1/2 or Sx = +1/2 is equally meaningless. 
For a spin-half particle there is a type of information associated with each pair w and —w 
of opposite directions in three-dimensional space, and the different species associated with 
distinct pairs are incompatible. 

In larger Hilbert spaces a quantum sample space or type of information always corre- 
sponds to a decomposition of the identity, a collection of mutually orthogonal projectors 
V = {Vj}, Vj = Vj = Vj^, that sum to the identity I. In the case of an orthonormal basis, 
Vj = \vj){vj\ and {vj\vk) = Sjk, we also write V = since the meaning is obvious. Two 

such collections or types of information V and W are compatible if and only if all projectors 
in one commute with all projectors in the other; otherwise they are incompatible. The "sin- 
gle framework" rule of quantum reasoning [23] generalizes the example discussed above, and 
states that incompatible quantum descriptions (decompositions, information types) cannot 
be meaningfully combined. 



Sx = +1/2 AND Sz = +1/2, 
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Ill Teleportation 
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Figure 1: (a) One bit teleportation. (b) Quantized version, (c) Channel ket construction. 

Let us see how using incompatible types of information assists in understanding why 
quantum teleportation uses two classical bits of information in the standard protocol [24]. 
It is simplest to start with a variant known as "one bit" teleportation [25], corresponding 
to the quantum circuit in Fig. [l](a), where the teleportation process transports the state \ip) 
from the upper left a to the lower right b'. First, a CNOT, shown as a controUed-X (CX) 
gate, acts between qubits a and b, and then an Sx measurement is carried out on qubit a. In 
the figure this measurement is indicated by the Hadamard gate H that interchanges and 
Sz, followed by a measurement in the standard or Sz or "computational" basis indicated by 
the D-like symbol. If the measurement reveals 5*^ = —1/2 a classical bit (dashed line labeled 
x) is transmitted to where it actuates a Z gate on qubit b, whereas if 5*^ = +1/2 nothing is 
done. It is an easy exercise to show that whatever initial state {ip) = a\0) + (3\1) enters at a 
will later reappear at b'. 

In the case of Z information, meaning the input is = |0) or |1), corresponding to 
S*^ = +1/2 or -1/2, the CX (CNOT) gate copies it from the a to the b qubit so that 
Sl = Sz, and the later Z gate has no effect, since even if it acts it only changes the phase of 
|1), leaving the ray (or projector) corresponding to 5*^ = —1/2 the same. Thus failing to do 
the measurement, or throwing away the classical bit, has no influence so far as transporting 
the Z information is concerned. 

In the case of X information the input \ip) is either |+) = (|0) + |1))/a/2 or |— ) = 
(|0) — |1))/a/2, corresponding to Sx = +1/2 or —1/2, and the analysis is somewhat more 
complicated. The CX gate maps \ip) = |+) into the two qubit state | + +) + | ) corre- 
sponding to S^ = S^, and = |— ) into | H — ) + | — \-), S^ = —S^. This means the original 
X information is not present in either qubit by itself, since the corresponding reduced den- 
sity operator is |/, but resides in a correlation between the two. Information residing in a 
correlation is not in itself a quantum effect. One can, for instance, encode a classical bit 
{Ol, 1l} in two coding bits by letting 00 or 11, chosen at random, represent 0^, and 01 or 
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10, again chosen at random, represent 1l- From either coding bit alone one can extract no 
information about 0^ versus 1^, but it is obviously present in the two together through their 
correlation. In the case under discussion the measurement in Fig. [It^a) extracts the value of 

after the CX has acted (note that this is not the original X information), and if this is 
negative the Z gate applied to qubit b changes the sign of S^. The net effect is that at the 
end of the process the value of is exactly the same as that of at the beginning, so the 
X information has also been successfully transmitted from a to b'. 

One could continue to check what happens to other types of information, but that is not 
necessary. The Presence theorem of Sec. IV Al says that once it is known that two suitably 
incompatible types of information, Z and X in the case at hand, are present in the output b', 
all other types of information about the input a are also present, so there is a perfect quantum 
channel from a to b', the desired result for teleportation. In summary, the transmission of 
two (suitably incompatible) types of information is needed to ensure that there is a good 
quantum channel from a to 6'. The CX gate by itself transmits the Z type, while the later 
measurement and the single classical bit carrying its outcome are needed to transmit the 
incompatible X type of information. 

The Presence theorem is a statement about quantum information discussed in fully quan- 
tum terms, so to apply it to the system in Fig. [T]^a) one needs to understand the measure- 
ment and the "classical" bit in quantum terms. This can be done in the manner indicated in 
Sec. IIVI But for present purposes it is convenient to avoid having to introduce the Hilbert 
space of a complicated macroscopic system, by "quantizing" the circuit in Fig. [H^a) so that 
it takes the form shown in (b), with the measurement of the a qubit following the part of 
the circuit where it controls (in the usual quantum sense) a Z gate. (See [26] and pp. 186f 
in [1] for this "trick," based on ideas in [27].) The two circuits in (a) and (b) are equivalent 
so far as teleportation is concerned, but the second is simpler to analyze in fully quantum 
terms. Indeed, the later measurement of qubit a in (b) need not be made at all, which is 
why it is not shown, as its outcome is not used in the protocol. (This discussion continues 
in the latter part of Sec. HVl ) 

In addition, the Presence theorem is stated in Sec. IV Al in the language of entangled kets, 
rather than in terms of the input and output of a quantum channel. One way of connecting 
the two is indicated in Fig. [H^c), where an auxiliary qubit d has been introduced, and 



where {|aj)} and {|aj)} are orthonormal bases of Tia and Tia, is a fully-entangled state. The 
result is a final state |\E') G Ha ® Ha ® Hb, referred to as a "channel ket" in [8] (which see for 
additional details). That there is a perfect quantum channel from a to b' in Fig. [T]^a) or (b) 
is the same as saying that all the information about qubit d is in qubit b' if one uses | \E') , or 
equivalently the reduced density operator obtained by tracing over Hb, to generate 

probabilities for correlations between the two in the usual way. 

An alternative way to associate channels with kets is to use map-state duality [8,28] in 
which an entangled ket 
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on the tensor product of Tia and another space Ti-f is expanded in an orthonormal basis of 
Tia, with the (unnormahzed) expansion coefficients. One can always "transpose" 

to an operator 

M = J2\<j>,){a,\ (4) 
j 

mapping Tia to Tif. In the particular case in which {ip) is maximally entangled, which is to 
say TTf{\ilj){ilj\) is proportional to the identity 1^, the map M is, up to normalization, an 
isometry (a unitary operator from Ha to the subspace MTCa of 'Hf): that is, M'^M = la- 
Conversely, given a map M from Tia to Tij and an orthonormal basis of Tia-, it can be 
expanded in the form (jll), and (I3l) then defines a corresponding entangled state (typically 
not normalized), which when M is an isometry is maximally entangled. Of course, if \ip) 
is given, M depends on the choice of orthonormal basis and vice versa, but the 

Presence theorem is unaffected by the basis choice. While this and the other theorems in 
Sec. |V] can be expressed either in "map" or "entanglement" language, the latter has the 
advantage of being more symmetrical (see remarks in Sec. VI of [8]). The idea of regarding 
the input and output of a quantum channel as corresponding to the tensor product of two 
Hilbert spaces, as suggested by the preceding discussion, is a very natural notion when using 
atemporal diagrams [29], and within the consistent histories approach to probabilistic time 
development [11], where the idea of such a tensor product goes back to Isham [30]. 
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Figure 2: (a) Standard (two bit) teleportation. (b) Quantized version. 

Conventional "two bit" teleportation. Fig. [21 with V2\Bo) = |00) + |11), can be analyzed 
in the same way; the details are left as an exercise. The two classical bits in (a) are labeled x 
and z to indicate that they are essential for correct transmission of the Z and X information, 
respectively; throwing z away will not affect X information, and x is dispensable if only Z 
information is of interest. Neither classical bit, nor the two together, actually contain any 
information in themselves, a consequence of No Splitting, Sec. IV B[ All the information 
is in correlations of the classical bits with the b qubit (more details in [31]). One classical 
bit is needed for each of these two incompatible types of quantum information, but that 
is enough, for the Presence theorem then guarantees that all other species are correctly 
transmitted, so one has a perfect quantum channel from a to 6'. Figure 2(b) is the quantized 
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version of (a), which is convenient for analyzing the situation in quantum terms, and one 
could once again introduce a channel ket (not shown), this time on four qubits, in 
analogy with Fig. [I](c). The Presence theorem can be applied to the channel ket, or one 
can regard the channel entrance as constituting its own Hilbert space. Generalization to the 
teleportation of a qudit (with Hilbert space of dimension > 2) is straightforward: if two 
suitably incompatible types of information are correctly transmitted — each type requires a 
classical "dit" — one has a perfect quantum channel. 

IV Decoherence and Classical Information 




Figure 3: Interferometer with beamsplitters B and B'. 

An application of types of information to a simple case of decoherence is shown in Fig. [31 
where a particle (neutron or photon) enters an interferometer on path d at beamsplitter B 
and, because at an intermediate time it is in a coherent superposition (|e) + \f))/^/2, leaves 
the second beamsplitter B' in channel h rather than g. But if while inside the interferometer 
some interaction with the environment leaves a trace indicating that the particle took path 
e rather than /, or vice versa, the interference effect is lost, and the particle emerges with 
equal probability in g or h. Let Z be the e vs. / "which way" information, and X be the 
(|e) + |/))/-\/2 vs. (|e) — |/))/-\/2 "coherent" information. Decoherence, the disappearance 
of coherence, in this case X information, when Z information about the path resides in the 
environment, illustrates the Exclusion theorem of Sec. El one type of information about Sa 
perfectly present in Sb means that a mutually-unbiased type is completely absent from Sc- 
Two types of information X and Z are said to be mutually unbiased if they correspond to 
mutually unbiased orthonormal bases and {l^fc)}, with equal to 1 divided 

by the dimension of the Hilbert space, independent of j and k. 

To apply this theorem to the situation in Fig. [31 think of the particle that has just passed 
through the first beam splitter as system Sa, and just before it reaches the second beam 
splitter B' as Sc, while Sb is the environment at this second time. (See the discussion in 
Sec. [nil on why one can regard the particle at two different times as two separate systems, 
and how to apply the Exclusion theorem, worded in terms of entangled states, to situations 
with unitary time evolution.) For our purposes it suffices to model Sa and Sc using a. d = 2 
dimensional Hilbert space spanned by |e) and |/) — this is analogous to focusing on the 
spin of a particle when its other degrees of freedom are not relevant to the analysis. The 
Exclusion theorem says that when the Z or which-way information about Sa is perfectly 
present in the environment, i.e., at the time the particle reaches the second beam splitter, 
the (mutually unbiased) X or coherence information must be perfectly absent from Sc, i.e., 
from the particle itself at this later time. And in the absence of coherence all interference 
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effects disappear: the situation after the second beam sphtter is, statistically, just the same 
as if the particle arrived at random on path e or path /. All this is well known, and the 
connection between decoherence and information in the environment has been previously 
pointed out by Zurek and his collaborators [32-35]. The use of types of information, not tied 
to some notion of measurement [36], is our attempt to add further clarity and precision to 
these seminal ideas. 

The situation to which the Exclusion theorem applies is that of strong, meaning essen- 
tially complete, decoherence. Clearly extensions are needed (Sec. IVl|) to cases of only partial 
decoherence. Nevertheless, strong decoherence is a useful idealization both because if is often 
a good approximation to what is realized in the laboratory (to the dismay of those who want 
to build quantum computers), and because it yields a precise definition of another idealiza- 
tion, classical information. Indeed, it is rather odd to find the term "classical information" 
floating around in technical books and articles on quantum information theory when most, 
even if not all, physicists believe that all physical processes in the real world are quantum 
mechanical, with classical physics a good approximation in appropriate circumstances, but 
hardly part of our fundamental understanding of nature. A good way to see how "classical" 
information can arise in quantum mechanics is to note that one consequence of the Trun- 
cation theorem as discussed in Sec. IV Bl is the fact that if a particular type of information 
about Sa associated with an orthonormal basis is perfectly present in Sb it is the only 

type of information about Sa which can be present in a third system Sc in the sense that 
any other species of information is parasitic upon, or controlled by, or compatible with the 
{l^j)} type. Whenever only one type of information needs to be considered all the rules of 
classical information theory apply to it; conversely, "classical information" in the quantum 
context refers to the single dominant type of quantum information available in a situation of 
strong decoherence. Typically it is the presence of this type of information in the environ- 
ment that means that other types can be ignored in systems which are not isolated from the 
environment. In particular, the measurements indicated in part (a) of Figs. [T] and [2] when 
instantiated in physical apparatus amplify a particular type of information, and the environ- 
ment rapidly copies the "pointer positions," resulting in strong decoherence. To avoid the 
rather unwieldy task of trying to describe this amplification process and interaction with the 
environment in correct quantum mechanical terms, which is certainly possible in principle, 
it is often preferable (as noted earlier) to employ a simple quantum circuit in which the 
decoherence is "built in": the a qubit in Fig. [Tt^b), and the a and c in qubits in Fig. [2t^b), are 
at later times good copies of the Z information preceding the final control gates, and since 
no further use is made of them, they may be regarded as carrying this type of information 
off into the environment. 

V Theorems 

In this section we state and prove results used in the preceding sections, plus some 
additional ones that are closely related. The treatment builds upon ideas and terminology 
from [8], repeated here to the extent needed to make the exposition self-contained. Note in 
particular that Ha is the Hilbert space of system Sa, 'Hab = 'Ha®'Hb that of Sab, the systems 
Sa and Sb regarded as a single system, pab is a density operator on Tiab, often traced down 
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from that of a larger system, da the dimension of Ti,a, and so forth. All Hilbert spaces are 
assumed to be of finite dimension in order to avoid technical complications. 

V A Presence 

Theorem (Presence). Let Sa and Sb be two quantum systems with Hilbert spaces Tia 
and Tib, o,'>T'd V = {Vj} and W = {Wk} two strongly incompatible projective decompositions 
of the identity la- If both the V and the W information is perfectly present in Sb for a density 
operator p on Tiab (possibly a pure state), then all types of information about Sa are perfectly 
present in Sb- 

The terms are to be understood as follows. The density operator p on Tiab or pure state 
1^') e Tiab will be called a pre-probability using the terminology of Ch. 9 of [11], because it 
can be used to generate probabilities once a quantum sample space — an orthonormal basis 
of Tiab or a decomposition of the identity lab — has been specified, following the usual rule 
that the probability associated with a projector P is 

Pr(P) = (P) = Tr(Pp) = (^|P|^), (5) 

with p = |^E')(^'| for a pure state. For example, p as a density operator is represented by 
different matrices if different orthonormal bases are chosen. The diagonal elements of one 
of these matrices form a probability distribution associated with the corresponding basis 
(or type of information), whereas the single density operator giving rise to the different 
distributions is the pre-probability. 

The V type of information is perfectly present in Sb for a given pre-probability (Sec. Ill 
C of [8]) when the unnormalized conditional density operators 

Pb3 = {V^p) (6) 

on Tib are mutually orthogonal, i.e., 

Pbj Pbk ^ for j ^ k. (7) 

In the language of measurements, if one thinks of carrying out a projective measurement 
on Tia corresponding to {Vj}, then there is a corresponding decomposition {T^} of lb such 
that the measurement outcomes are in one-to-one correspondence. An analogous definition 
applies to W information. The conclusion of the theorem, that all species of information 
about Sa are perfectly present in Sb, conveniently abbreviated to "all information about Sa 
is in Sb," means that for any decomposition of the identity, in particular for any orthonormal 
basis of Tia, that kind of information is perfectly present in Sb in the sense just discussed. 
When the pre-probability is a pure state |\1'), this implies it is maximally entangled, i.e., pa = 
Trb(|\l/)(\1'|) is proportional to the identity /„. For a general p a similar but more complicated 
result obtains — see theorem 3(ii) in [8] — and once again pa = Ttb{p) is proportional to /„. 

The decompositions V and W are said to be strongly incompatible (Sec. IV of [8]) when the 
only projector P that commutes with every Vj and every Wk is either P = or P = /„. While 
concise, this definition is not very intuitive. In the case of orthonormal bases V = {l^'j)} and 
W = one can use a somewhat simpler definition. Construct a graph containing 2da 

nodes, one for each \vj) and one for each \wk). Whenever the inner product {vj\wk) is nonzero. 
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draw an edge between the corresponding nodes. Then V and W (i.e., the corresponding 
collections of projectors) are strongly incompatible if and only if this graph is connected. 
The proof is at the end of App. |Al In the case of two mutually unbiased bases, as in Sec. HVt 
every is connected to every {|wfc)} node, so connectivity of the graph is obvious. It is 

equally obvious for two bases in which {vj\wk) is never zero. However, strong incompatibility 
can still hold if some of the {vj\wk) are zero, provided the graph remains connected. 
The proof of the Presence theorem, extending a weaker theorem in [8], is in App. |X] 

V B Truncation, Exclusion, No Splitting, Somewhere 

A series of useful "all-or-nothing" results about information in three systems Sa, Sb, and 
Sc begins with: 

Theorem (Truncation). Let Sa, Sb and Sc be three quantum systems, and suppose that 
for some decomposition V = {Vj} of la and for some density operator p on Tiabc all the V 
information about Sa is present in Sb- Then any other type of information W = {Wk} about 
Sa will be "truncated" (or "censored") in the sense that 

Pac = ^VjPacVj, (8) 

j 

that is, pac, the partial trace of p over Tib, commutes with all the Vj. (Note that Vj is here 
understood as Vj ®Ic on Tiac-) Equivalently, all correlations between Sa and the third system 
Sc satisfy 

{AC) = (AC) (9) 

for any operators A and C on Tia ad Tic, respectively (one could write A ® C in place of 
AC), with 

A = J2V,AV,. (10) 

j 

the truncated version of the operator A, and { ) the average taken with respect to p, as in 

m- 

This theorem is closely related to, but not the same as, theorem 6(i) in [8], and its proof 
is in App. [HI Since any operator A can be written as A = J2jk '^j^'^k, its truncated version 
A is obtained by throwing away the off-diagonal blocks. To understand the implications of 
the theorem it helps to consider the case in which V = {\vj)} is a.n orthonormal basis of Tia, 
so that 

A = J2^v,\A\vj)\v,){vj\ (11) 
i 

is diagonal in this basis, meaning that all correlations between A and C can be computed 
from the correlations (VjC), that is from V information about Sa in Sc. Equivalently, 

Pac = J2\Vj){Vj\0Tj, (12) 

j 

where the Tj are operators on Tic. All other information about Sa in Sc, of whatever kind, is 
then "parasitic upon," "truncated by," or "censored relative to" the V information. When 
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the Vj projectors have rank greater than 1 the truncation or censorship is less extreme, but 
it remains true that the only sort of information about Sa allowed in Sc is represented by 
A-type operators which are compatible with V in the sense of commuting with every Vj or, 
equivalently, pac commutes with every Vj. 

The situation is particularly clear if there is another basis W = {1^^)} which is mutually 
unbiased with respect to V = i.e., \{wk\vj)\^ = l/da for all j and k. In that case the 

truncated projectors Wk are not only diagonal in the V representation, but are all equal to 
la/da independent of k, proving the next theorem (which is the same as theorem 7(ii) in [8]): 

Theorem (Exclusion). Let Sa, Sb and Sc be three quantum systems, and V = {Ivj)} 
and W = {l^fc)} two mutually unbiased orthonormal bases ofTia- Then if the V information 
about Sa is perfectly present in Sb, the W information about Sa is perfectly absent from Sc- 

The perfect absence of some type of information can be defined using reduced density 
operators, as in ([6]), but now they are required to be the same up to a multiplicative constant. 
That is, the W or {Wk} information about Sa is perfectly absent from Sc if and only if for 
every k 



where pc = TTa{pac), Pac is the density operator (pre-probability) on Hac, and the pk are 
nonnegative numbers. One can think of this in terms of measurements as saying that when 
a projective measurement corresponding to {Wk} is made on Sa, the probability of any 
measurement on Sc conditioned on the outcome k will be independent of k. 

Below we will need the notion of the perfect absence of all types of information about Sa 
from Sc, conveniently abbreviated to "no information about Sa is in Sc" This is equivalent 
to Pac = Pa®Pc, or to = I7) for a pure state, theorem l(iii) of [8]. As the relationship 
is obviously symmetrical, one can also say that Sa and Sc are uncorrelated. 

An important corollary of the Exclusion theorem is: 

Theorem (No Splitting). Let Sa, Sb and Sc be three quantum systems. If all types of 
information about Sa are perfectly present in Sb, then all types will be perfectly absent from 
Sc. That is, if all information about Sa is in Sb, no information about Sa is in Sc. 

This is theorem 8(i) in [8]. It follows at once from the Exclusion theorem, because to show 
the absence of some species of information about Sa in Sc it suffices to consider orthonormal 
bases, and for each of these we know that there is at least one mutually unbiased basis for 
which all the corresponding information is, by hypothesis, perfectly present in Sb- The No 
Splitting theorem has lots of applications. For example, in either one or two bit teleportation 
after the final corrections have been made, there is no information about the input state lip) 
remaining in the environment treated as a quantum system, and since copies of the classical 
bits X and y used to complete the protocol can remain in the environment, it is evident 
that they, as has often been observed, can contain no information about the input: their 
probabilities cannot depend upon \ip). In the case of quantum codes the presence of the 
encoded information in some subset of the coding bits (which is what makes error correction 
possible) means its absence from the complementary subset of coding bits, and this can 
provide additional intuition about the coding process [8]. 

Is there a converse to the Exclusion theorem which says that if the W information about 
Sa is perfectly absent from Sc, then that associated with any mutually unbiased basis V 
must be present in iS;,? No, not even if one knows that all information about Sa is present 




(13) 
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in the combined system of Sbc, see the end of App. [B]for a counterexample. There is, on the 
other hand, a partial converse of the No Splitting theorem: 

Theorem (Somewhere). If for a pure state pre-probability |\1') on Tiabc it is the case 
that all the information about Sa is in the combined system She, and none of it is in Sc, then 
it is all in Sb- 

The name "Somewhere" comes from the idea that if we know that an object is in one 
of two rooms and it is not in the second, it has to be in the first: it must be somewhere. 
However, information is very different from a lost child, as it can be present in correlations 
between two systems, while not being available in either system by itself. See, for example, 
the discussion of X information in one bit teleportation in Sec. Illll Consequently, the 
Somewhere theorem is a decidedly quantum mechanical result. Also, it fails (in general) 
if the pure state is replaced by a density operator. The theorem itself is proved in [8] as 
theorem 8(ii), where one will also find an application to quantum codes. 

V C Absence 

Given the Presence theorem one might anticipate a similar Absence theorem. It comes 
in two versions: 

Theorem (Absence). Let Sa and Sb be two quantum systems. 

i) Simple version. If the pre-probability is a pure state |\Ef) on Tia^Tib and the informa- 
tion associated with a single orthonormal basis {\vj)} of Tia is completely absent from Tib, 
then |\[') is a product state of the form \a) ^ \b) , so there is no information about Sa in Sb 
or vice versa; the two are uncorrected. 

a) Complicated version. Let the pre-probability be a general density operator on l-La®'Hb, 
and let {V"*-™'-'} be a collection of decompositions of the identity la ofHa, 

h = Y.^t^ (14) 

i 

for each m, where the V^™"^ for different j are the projectors belonging to V^"^\ If the 
collection {vj"^^} for all m and all j spans the space Ha of operators on Ha, and if each 
species V^"^^ of information about Sa is completely absent from Sb, then 

P = Pa®Pb, (15) 

so there is no information about Sa in Sb or vice versa; the two are uncorrelated. 

The proof of both versions is given in App. O For version (ii) the conditions are definitely 
more complicated than for the Presence theorem: if Ha has dimension da, one needs to check 
not two but at least da + 1 orthonormal bases (see end of App. [C]) in order to be sure that all 
information is absent. For instance, if Sa is a qubit, da = 2, it suffices to check that the X, 
y, and Z types of information are absent, but two out of the three is not enough, as shown 
in the example at the end of App. [Bl 

V D No Cloning 

One might suspect that the No-Splitting theorem is the same as, or at least closely 
related to, the well-known no cloning result [9]. However, the two seem to be different, since 
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neither the conditions nor the consequences of no-cloning are expressed in terms of types of 
information as used here. The following theorem is the closest we have been able to come 
in finding a connection between the two. 

Theorem (Generalized No Cloning). Let M be an isometry from Tia to Tibc, (ind {\otj)}, 
with j lying in a finite index set J, a collection of normalized kets on Tia with the property 
that the pairs {j,k) for which the inner product {aj\ak) is nonzero when treated as edges 
produce a connected graph on the set J. Assume that for each \aj) its image under M is a 
product state 

M|a,) = \(3,) ® |7,), (16) 
where both and \ are normalized, and that 

\{c^j\c^k)\ = mm (17) 

whenever the left side is nonzero. 

Under these conditions M restricted to the subspace Qa spanned by the {\otj)} is of the 

form 

M|a) = (f/|a)) ® |7i), (18) 

where U is a unitary map of Qa onto the subspace Qb of Tib spanned by the and |7i) 

is a fixed ket in He- 

The proof is in App. [Dl The connection with no-cloning, not obvious given the somewhat 
abstract statement of the theorem, is the following. Suppose j takes on just two values 1 and 
2, the states \ai) and \a2) are linearly independent, and (ai|a2) 7^ 0. Imagine these are two 
states to be cloned, and the isometry M (which can be replaced with a unitary acting on the 
tensor product of Ha and an additional space Hs initially in a state |so)) is supposed to carry 
out the cloning process. If and |/52) are good copies up to some unitary transformation 
of Tib, their inner product must equal (Q;i|a2) apart from an unimportant phase. As the 
conditions of the theorem are fulfilled — the graph consists of two nodes joined with the edge 
(1,2) — it follows that M is not only unable to produce additional copies in Q^, but in fact 
there is no information at all in Qc which would allow distinguishing the states and 
|a2). Thus at least for the subspace Qa (which could be all of Ha if the span of is 
large enough) one arrives at the same conclusion as with the No Splitting theorem, but using 
somewhat different hypotheses. 

VI Conclusion 

Identifying types or species of quantum information and noting when they are compat- 
ible (i.e., the projectors commute) or incompatible looks like a promising approach to the 
foundations of quantum information for the following reasons. First, it allows a more in- 
tuitive, as well as a fully consistent, approach to quantum probabilities at the microscopic 
level, in contrast to the usual textbook approach, with its preparations, measurements, and 
"great smoky dragon" [37], long known to provide an awkward and difficult (and internally 
inconsistent [38,39]) way of thinking about the quantum world, however effective it may be 
as a calculational tool for the final outcomes of measurements. Second, the ideas of classical 
information theory [3] are directly applicable to quantum systems as long as one restricts 
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oneself to a single type of quantum information, or to two or more compatible types (which 
can then be combined to form a single type), because there is a properly defined sample 
space on which probabilities of quantum events and processes, and their correlations, satisfy 
the standard rules of probability theory, which are fundamental to the structure of informa- 
tion theory as developed by Shannon and his successors. Note that it is not necessary to 
restrict oneself to macroscopic systems or asymptotically large N (number of transmissions, 
or whatever) limits. 

Third, the existence of different incompatible species of quantum information is at the 
heart of the objections raised in [4] to extending Shannon's theory to the quantum domain. 
Recognizing the role of quantum incompatibility and using different information types gets 
around these problems and allows a fully consistent formulation of the microscopic statisti- 
cal correlations needed to properly begin the "quantization" of classical information theory. 
Fourth, one of the principal ways quantum information goes beyond its classical counterpart 
is in its discussion of how incompatible types of information relate to, or so-to-speak con- 
strain, each other for a given setup, or quantum circuit, or entangled state. The Presence, 
Truncation, and Absence theorems and their various corollaries in Sec. |V] clearly do not 
belong to the domain of classical information, since their very formulation requires reference 
to noncommuting operators, the hallmark of "quantum" effects. 

The approach presented here provides, we believe, new intuitive insight into the processes 
of teleportation and decoherence, and into how "classical" information can be consistently 
described as a quantum phenomenon. It is obviously incomplete in two respects. First, the 
theorems of Sec. |V] are of the "all or nothing" variety: they apply to extreme situations in 
which information is either completely present or completely absent. Obviously it would 
be valuable to have quantitative extensions of these theorems, presumably in the form of 
inequalities, that apply to situations where information of different kinds is partially present 
or absent. Finding suitable information measures and proving appropriate bounds looks like 
a challenging problem, but one that needs to be addressed given that one is often interested 
in situations where there is noise, so different types of quantum information will be degraded 
in different ways. There are, of course, many inequalities involving quantum information in 
the published literature, and some of them, such as those of Hall [40,41], look as if they can 
be reformulated to apply to different species of information as discussed here. 

Second, the examples and theorems given in this paper (and their extensions beyond 
"all or nothing" noted above) need to be generalized to cases in which microscopic quantum 
properties are considered at a large number of successive times, as in the case of "quantum 
jumps" [42-44]. For this purpose it is likely that the full machinery of quantum histories [11] 
will be needed in order to provide consistent probabilistic descriptions without having to 
invoke the awkward concepts of macroscopic "preparation" and "measurement," which are 
obviously not a fundamental part of microscopic quantum mechanics. 
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A Presence Theorem 



The Presence theorem of Sec. |V]is an extension of theorem 4 of [8], where it was shown 
to hold when p is a pure state on Tiat- Our task here will be to remove that restriction and 
prove it for a general density operator p = pab, where subscripts have been added to avoid 
any confusion in the following argument. To that end first "purify" pab by introducing an 
auxiliary system He and a ket 

on Habe, with {|cq)} an orthonormal basis of He, and coefficients {iV'g)} chosen so that 

p,, = Tr,(|<|.)(<|.|) = 5^pJ^,)(^J. (A.2) 

Lemma Suppose that the V = {V^} information about Sa is perfectly present in Sb for 
Pab- Then that is also true for every \ipg) in (lA.ip for which pg > 0. 

To show this, insert flA.2p in place of p on the right side of so each pbj is a sum 
over q of positive operators pbqj. Then in order that ([7]) hold it is necessary (and obviously 
sufficient) that PbqjPbq'k = 0, for all q and q', whenever j ^ k. Setting q' = q gives the 
desired result. Now apply the lemma to both the V and the strongly incompatible W type 
of information. Since for one of the pure states \ijjq) on Hab both types of information about 
Sa are in Sb, theorem 4 of [8] tells us that for this pure state all information about Sa is 
present in Sb- This implies (and is implied by, see theorem 3 (i) of [8]) that \ijjq) is maximally 
entangled, or 

T:ib{\i^q){^q\)=Ia/da- (A.3) 

Next argue that the C = {\cq){cq\} type of information about Se is absent from Sa by 
showing that the conditional density operators 

^T^c{Pac\Cq){Cq\) = Pq{Ia/da) (A.4) 

depend on \cq) only through the numerical factor pg] see (fT3l) with systems appropriately 
renamed. Indeed, (I A. 41) follows from (lA.ll) when one replaces pae with Trfe(|$)($|) and 
interchanges the order of partial traces. The same argument applies if we use any other 
choice of orthonormal basis of He for the expansion flA.ll) : as is well known, changing that 
basis does not alter the density operator p = pab we began with, but simply expresses it in 
terms of a different ensemble. Consequently, all types of information about Se are absent 
from Sa, which is to say pae = Pa® pc, theorem 1 (iii) of [8], and thus all types of information 
about Sa are also absent from Se- 

One more step is needed. The presence of V information about Sa in Sb means it is also 
present in Sbe- (This is intuitively obvious, but can also be shown formally from the definition 
in (35) of [8], where one simply replaces with B^®Ie- Or one can use the definition in ([7]) 
of the present paper, along with the fact that if P and Q are positive operators on He ®Hf 
with PQ 7^ 0, then PeQe 7^ 0, where Pg and Qe are partial traces over Hf — use the spectral 
representations and take traces.) Of course the same is true of the W information. Hence, 
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applying theorem 4 of [8] to the bipartite system consisting of Sa on the one hand and She 
on the other, with |$) an entangled ket on Ha ® Hbc, we conclude, from the presence of two 
strongly incompatible types of information, that all the information about Sa is present in 
the combined system She- From this and the absence of all information about Sa from Sc 
demonstrated earlier, it follows from theorem 8 (iii) of [8], using |$) as a pure state pre- 
probability on Tiabc, that all information about Sa is in Sb- This completes the proof of the 
theorem. 

Next we will show that two orthonormal bases V = and W = are strongly 

incompatible if and only if the graph described in Sec. IV Al is connected. First, assume V 
and W are not strongly incompatible, so there is a projector P that commutes with every 
\vj){vj\ and every \wk){wk\, and is neither nor /, so projects onto a proper subspace V 
of the Hilbert space. Then some subset of the are inside V and the rest are in its 

orthogonal complement P"*", for otherwise V would not commute with them, and the same 
is true of the {\wk)}- Evidently there can be no nonzero {vj\wk) for a \vj) (or \wk)) in V and 
a \wk) (or \vj)) in V'^. Consequently the graph cannot be connected, as it has at least two 
components, one for V and one for P"*" vertices. 

For the converse, assume the graph is not connected, and renumber the vertices so that 
the (Ifj)} with 1 < j < m < da are all the v vertices in one connected component C of the 
graph, with the others in its complement. The projector 

m 

obviously commutes with all the and in addition commutes with every \wk){wk\ when 

\wk) is not in C, as otherwise there would be an edge from some \vj) with j < m to this \wk). 
Now apply the same argument with / — P in place of P to show that I — P commutes with 
every \wk){wk\ when \wk) is in C. Since P and I — P commute with the same things, we 
have shown that P commutes with all the \wk){wk\ as well as the \vj){vj\. As P is neither 
nor /, V and W are not strongly incompatible. 



B Truncation Theorem and Related 

If the V = {Vj} information about Sa is perfectly present in Sb, there is, as noted following 
([7]), a decomposition {T^} of lb such that 

{V,Tk) = 6,k{V,). (B.l) 

(Note that VjTk is V,- Tfc ® Ic-) The equivalence is shown in [8], where in fact flB.l|) is the 
primary definition. If the pre-probability defining () is a pure state G Ti-abc, flB.ip implies 
that 

V,Tk\<f ) = foi J =^ k. (B.2) 
As a consequence, and using the fact that la = J2j h = Tlk '^k, we have 

V,\^) = V,T,\^) = T,\^), (B.3) 
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and 



Therefore it foUows that 

{AC) = {^\[j2VjT,ACV,n]\^) = {^\[J2V,AV,C]\^), (B.5) 



jk 



which is ([9]) when the pre-probabihty is a pure state. In (IB.SP we have used the fact that Tj 
commutes with AC, TjT^ = SjkTj, and (lB.4p . 

To extend the argument to a general density operator p on Habc, introduce a fictitious 
system Hr, purify p to a ket G Hater, and apply (IB.Sp to the three part system consisting 
of 5a, Sb, and in place of Sc the combined system iScr- The significance of Vj, Tj, and A is 
the same as before, while C can be replaced by any operator on Tier- If in particular we use 
C ® Iti the result is ([9]). The equivalence of ([8]) and ([9]) is a straightforward exercise when 
one notes that {AC) = Tr^ACpac), and that ([9]) holds for all A and C (operating on Ha and 
He)- This completes the proof. 

The following example shows that the Exclusion theorem does not possess a simple 
converse of the type mentioned in Sec. IV B[ The entangled state 

2|^) = |000) + |011) + |100) - |111) (B.6) 

on Habc, with qubits in the order \abc), has the property that all information about Sa is 
present in the combined system Sbc, the X information about Sa is perfectly present in both 
Sb and in Sc, whereas both the y (basis {(|0) ± i\l))/\/2}) and the mutually unbiased Z 
information about Sa are perfectly absent from both Sb and from Sc- Perhaps the easiest 
way to check this is to expand |^) in the X, y, and Z bases of Ha in turn, and look at the 
coefficients in Hbc- That all the information about Sa is in Sbc follows from the observation 
that any one of these expansions (and therefore all three) is in Schmidt form with Schmidt 
coefficients of equal magnitude, so theorem 3(i) of [8] applies. Note that we have an example 
in which if Sa and Sb are considered two parts of a bipartite system with pre-probability given 
by the density operator Trc(|^)(^'|), one would be mistaken to suppose that the absence of 
the two mutually unbiased types of information 3^ and Z about Sa from Sb implied the 
complete absence of all information. This confirms the remarks at the end of Sec. IV CI 



C Absence Theorems 

For part (i), write |\E') in the form 

\^) = j2\'^j)^m, (c.i) 

where the \f3j) are expansion coefficients. The fact that the or information 

about Sa is absent from Sb means that the 1(3 j) are all proportional to one another, thus 
multiples of \Pi), assuming it is nonzero. Inserting \Pj) = Cj\Pi) in (IC.1|) shows that |\E') = 



|a) ® \Pi) is a product state, so no information about Sa is in Sb or vice versa. 
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To prove (ii) we employ an orthonormal basis {Qr}, < r < rf^ — 1 of the space Ha of 
linear operators on Ha, in the sense that 

{Qr, Qs) := {l/da)TTa{QlQs) = Srs, (C.2) 

with Qo = la, and thus TiaiQr) = for r > 0. Expand p as 

p={l/da)Y,Qr^Br, (C.3) 
r 

where, see (]C.2I) . the expansion coefficients are given by 

Br=TTa{QrP), (C.4) 

with 

Bo = Tr,(p) = Ph. (C.5) 
The absence from Sb of each species V^"^^ = {vj"^^} of information about Sa means that, see 

TlaiVj^'^p) = PjmPb = PjmBo, (C.6) 

where the pjm are nonnegative constants. By hypothesis, the collection {vj"^^} for all m and 
all j spans the operator space Ha, so any Qr can be written as a sum 

Qr = ^ ^ CrjmVj \ (C.7) 
jm 

with suitable coefficients Crjm- (These may not be unique, but that does not matter.) Insert 
flC.7p in flC.4p and use flC.6p to conclude that every Br is a multiple of Bo, and therefore 
P = Pa ® -So is a product. 

The need for at least da + 1 orthonormal bases of Ha in order to check that all information 
about Sa is absent from Sb can be seen in the following way. Each basis of Ha gives rise to da 
orthogonal, and hence linearly independent, operators in the (i^-dimensional space Ha- But 
these da projectors sum to the identity / for each such basis, and therefore u such bases will 
give rise to at most i^lda — 1) + 1 linearly independent operators, which is when u = da + 1- 



D Generalized No Cloning 

From f|T6l) one sees that for every j and k in J, 

I(7,l7.)l ■ mm = |(«,|MtM|a,)| = \{a,\a,)\, (D.l) 
and therefore, in view of (1171) . 

I(7,l7fc)l = l (D.2) 
whenever j ^ k and |(a;j|ajt)| 7^ 0. The pairs (j, k) for which flD.2p holds form, by hypothesis, 
a connected graph on J, which means that the normalized kets |7j) are identical apart from 
phase factors, so each is a multiple of just one of them; let us say |7j) = e*'^j |7i). Replace 
the right side of (fT6!) with ® where = e^'^^\l3j), and define the linear operator 
U on Qa so that 

U\a,) = (D.3) 
Since = \ {Pj\Pk) \ = K^jlttA:)!, U is unitary as a map from Qa to Qb- 
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